Skip to content

Add kp-detect-v20 submission#159

Open
kprofundis wants to merge 2 commits into
liamdugan:mainfrom
kprofundis:kp-detect-v20-submission
Open

Add kp-detect-v20 submission#159
kprofundis wants to merge 2 commits into
liamdugan:mainfrom
kprofundis:kp-detect-v20-submission

Conversation

@kprofundis
Copy link
Copy Markdown

kp-detect-v20 — Kareem Elsamadicy (Independent Researcher)

Release date: 2026-05-25

Detector: kp-detect-v20

Summary

Refines v15 baseline on RAID test rows by combining v15 score with an ensemble of 3,445 'pure-true' proteins — three-atom AND-products that have been validated to have precision = 100% on RAID training data (never voted wrong yet). These voters were filtered from 1M random candidates using a strict falsification rule (Popper-style: keep only proteins where every doc that fires the protein has the same target class; drop the moment a single counter-example fires).

Train OOF (5-fold, balanced n=27,900)

Variant AUC log_loss
v15 base alone 0.9870 0.5587
v20 (v15 + 3,445 pure-true protein voters, blend alpha=0.40) 0.9920 0.3179
Δ +0.005 −43%

Expected lift on RAID test: modest AUROC lift over v15 baseline (96.44 → ~97), with substantial log_loss reduction from the strict-precision voter ensemble.

Contact: kelsamadicy@gmail.com

@github-actions
Copy link
Copy Markdown

github-actions Bot commented Jun 5, 2026

Eval run succeeded! Link to run: link

Here are the results of the submission(s):

kp-detect-v20

Release date: 2026-05-25

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 95.31 and a TPR of 86.77% at FPR=5% and 70.87% at FPR=1%.
Without adversarial attacks, it achieved AUROC of 96.26 and a TPR of 89.89% at FPR=5% and 73.01% at FPR=1%.

If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant